From Learning to Choosing: How Decision-Making Evolves with Experience in Rats

Decision-making is a fundamental process that guides actions by selecting between various options based on their known or presumed outcomes, often using sensory inputs (Carandini and Churchland, 2013). The neural mechanisms by which the brain integrates complex information to make decisions are typically studied by measuring neural recordings, response times, and choice accuracy using well-trained animals (Carandini and Churchland, 2013). These experiments often employ two-alternative forced-choice (2AFC) designs, where animals are trained to choose between two stimuli presented simultaneously, assuming that the animal learns the value of each stimulus and decides based on an internal comparative evaluation. However, the traditional 2AFC design may be limited, as it does not consider that stimuli are not encountered simultaneously in nature and that decision-making strategies evolve during learning (White et al., 2024). Kacelnik et al. (2011) highlight these concerns, arguing that animals ’ choices on a 2AFC task could be predicted by latencies observed when animals are provided with a single offer. Furthermore, they suggest that the act of deliberation, observed when animals slow their response times to make a choice, could be an artifact produced as animals learn the 2AFC. The recent study by White et al. (2024) in eNeuro aimed to address these limitations by investigating decision-making dynamics during the initial learning phase. By isolating learning values of individual stimuli from the decision-making process, the researchers sought to understand how male rats can make choices for the ﬁ rst time on a 2AFC and how those choices change with experience. The

dual-offer trials compared with single-offer trials.Although the overall variability in highreward trials decreased with experience, response time variability remained higher in dual-offer trials.
Behavioral data was further analyzed using drift diffusion modeling (DDM), a commonly used computational model within the decision-making field.The DDM assumes that when faced with a binary decision, such as between high and low rewards, noisy evidence accumulates with time favoring one of the two options.When enough evidence is reached for one option, it crosses a threshold, allowing for a decision to be made (Myers et al., 2022).DDM revealed that initial task learning lowered the decision threshold, meaning rats required less evidence to choose between high and low rewards.Interestingly, the decision threshold of the rats was highly correlated with the variability observed in the ExGauss fitting.As discussed by the study authors, this relationship suggests that lowering the decision threshold with experience could form a more reliable decision-making system, resulting in decreased variability in response times.White et al. (2024) provide novel insights into how rats adapt their decision-making processes through learning using a two-stage experimental design, demonstrating that this process does not rely solely on fixed strategies.The differences in latencies observed during the value learning stage between high-and low-luminance stimuli indicate the differentiation between and encoding of reward values during initial learning.When transitioned to the choice learning stage, rats initially exhibited longer response latencies in dual-offer trials compared with single-offer trials, suggesting the rats deliberate when choosing between options of known value.These latencies persisted at an attenuated level over the remainder of the choice learning sessions, suggesting that experience shapes decision-making performance.
Importantly, this finding contrasts with Kacelnik et al. (2011), who suggested that animals do not deliberate when making simultaneous choices but rely on the same processes  2024), separating reward value learning from the decision-making process.During the value learning phase, rats were introduced to single-offer rewards.They initiated trials by nose-poking in the central nose port, which presented a visual stimulus through LEDs on either the left or right side.The visual stimuli were equally split between high luminance (high reward, 16% sucrose) and low luminance (low reward, 4% sucrose).In the subsequent choice learning phase, trials were divided equally among high-reward, lowreward, or dual-offer trials, where male rats had to choose between stimuli of high and low luminance.The results indicated that rats responded faster to high-luminance stimuli than low-luminance stimuli in single-offer trials.Response times were slower in dual-offer trials compared with single-offer trials, suggesting a deliberation process affecting decision-making.Response times decreased with experience, demonstrating the effect of initial learning on subsequent decision-making [adapted from White et al. (2024)].
as when options are presented sequentially.The findings by White et al. (2024) suggest that rats do engage in comparative evaluation when faced with simultaneous choices and that this process evolves during the learning phase.
The observation of deliberation processes in rats marks a significant stride forward in understanding how decision-making evolves with experience.To better elucidate the decision-making processes in different contexts, future studies can alter stimulus complexities and reward levels, introducing rats to varying decision difficulties.Additionally, it is worthwhile to investigate if female rats demonstrate similar results, as prior studies have demonstrated sex differences in the learning period of decision-making (Chen et al., 2021).Evaluating decision-making in this context across sexes and using other animal models could enrich our understanding of decision-making processes across different biological systems.Ultimately, this study underscores the importance of experimental designs reflecting the natural sequential encounter of stimuli and considering how learning may influence decision-making strategies.By employing such approaches, the neural and cognitive mechanisms underlying decision-making can be better understood.

Figure 1 .
Figure 1.Visual abstract of the experimental design and findings by White et al. (2024), separating reward value learning from the decision-making process.During the value learning phase, rats were introduced to single-offer rewards.They initiated trials by nose-poking in the central nose port, which presented a visual stimulus through LEDs on either the left or right side.The visual stimuli were equally split between high luminance (high reward, 16% sucrose) and low luminance (low reward, 4% sucrose).In the subsequent choice learning phase, trials were divided equally among high-reward, lowreward, or dual-offer trials, where male rats had to choose between stimuli of high and low luminance.The results indicated that rats responded faster to high-luminance stimuli than low-luminance stimuli in single-offer trials.Response times were slower in dual-offer trials compared with single-offer trials, suggesting a deliberation process affecting decision-making.Response times decreased with experience, demonstrating the effect of initial learning on subsequent decision-making [adapted fromWhite et al. (2024)].